README "Replication Data for: Reducing Parent-School Information Gaps and Improving Education Outcomes: Evidence from High-Frequency Text Messages" (By Berlinski, S., M. Busso. T. Dinkelman and Martinez C.)


Files

1. "datasms_final.dta": This dataset includes the variables used for all main and appendix Tables and Figures except for Figures 4 and 5 and Appendix Tables 4 and 5. It includes administrative and surveys data. 
It is in long format (at the student-date level, where date is baseline, midline and follow-up). For surveys, baseline is beginning of 2014 and and for administrative data is 2013 (variables year_admin and year_surveys details this).

2. "datasms_monthly.dta": This dataset includes the variables used for Figure 5 and Appendix Table 5. It is in long format (at the student-year-month level).

3. "attendance_daily.dta": This dataset includes the variables used for Figures 4 and Appendix Table 4. It is in long format (at the student-year-month-day level). It includes daily attendance information.

4. "scripts": This folder contains the main dofile (results.do) that replicates the tables and figures of the paper using as input previously presented datasets. Please see notes at the beginning of each table/figure in dofile for more information. It also contains auxiliary ado files that are called by the main do file.
